SemanticScuttle - klotz.me » klotz: machine learning+computer science

Machine learning for email spam filtering: review, approaches and open research problems

"We present a systematic review of some of the popular machine learning based email spam filtering approaches."

"Our review covers survey of the important concepts, attempts, efficiency, and the research trend in spam filtering."

2024-09-18 Tags: computer science, privacy, machine learning, spam, anti-, filtering, deep learning, neural networks, support vector machines, naive bayes by klotz

Reducing Transformer Key-Value Cache Size with Cross-Layer Attention

This paper introduces Cross-Layer Attention (CLA), an extension of Multi-Query Attention (MQA) and Grouped-Query Attention (GQA) for reducing the size of the key-value cache in transformer-based autoregressive large language models (LLMs). The authors demonstrate that CLA can reduce the cache size by another 2x while maintaining nearly the same accuracy as unmodified MQA, enabling inference with longer sequence lengths and larger batch sizes.

2024-05-26 Tags: transformer, autoregressive language models, key-value cache, attention, multiquery attention, cross-layer attention, machine learning, computer science, llm, mit, csail by klotz

A Deep Learning Approach to Data Compression – The Berkeley Artificial Intelligence Research Blog

2019-09-20 Tags: compression, entropy, lossless, deep learning, data, computer science by klotz

How to analyze “Learning”: Short tour of computational learning theory

2018-10-31 Tags: machine learning, computer science by klotz

SemanticScuttle - klotz.me

klotz: machine learning* + computer science*

Linked Tags

Related Tags